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Abstract 

We prove that if A is a large random relational structure (with at 
least one relation of arity at least 2) then the homomorphism extension 
problem EXT(A) is almost surely NP-complete. 
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1 Introduction 

The complexity of the constraint satisfaction problem (CSP) with a fixed 
target structure is a well established field of study in combinatorics and 
computer science (see [1] for an overview). In the last decade, we have seen 
algebraic tools brought to bear on the question of CSP complexity, yielding 
major new results (see e.g. [2], [3], pQ). 

In the algebraic approach, it is customary to study relational structures 
that contain all possible constants. If A is such a structure and we are to 
decide the existence of a homomorphism / : IB — > A then the constant con- 
straints prescribe values for / at some vertices of B. We are thus deciding 
if some partial homomorphism f c : M — > A can be extended to the whole B. 
Therefore, CSP(A) becomes the homomorphism extension problem with tar- 
get structure A, denoted by EXT (A). It is easy to see that CSP (A) reduces 
to EXT (A), since in CSP we extend the empty partial homomorphism. 

In [5], the authors prove that CSP(A) is almost surely NP-complete for 
A large random relational structure with at least one at least binary relation 
and without loops. We show by a different method that the same hardness 
result holds for EXT(A) even if we allow loops. 



1 



2 Preliminaries 



A relational structure A is any set A together with a family of relations 
{Ri : i G 1} where Ri C A ni . We call the number rtj the arity of The 
sequence (rij : i G J) determines the similarity type of A. We consider only 
finite structures (and finitary relations) in this paper. We use the notation 
[n] = {1,2,..., n}. 

Let A = (A, {Ri : i G /}) and B = (J3, {Si : z G I}) be two rela- 
tional structures of the same similarity type. A mapping / : A — > B is 
a homomorphism if for every i G / and every . . . , a n .) G i?i we have 

(/(oi), • • • G 

Let us fix some p G (0, 1) and let A be a set. The relation S C A 1 is 
an Z-ary random relation on ^4 if every possible Z-tuple belongs to S with 
probability p (independently of other /-tuples). We will call any relational 
structure with one or more random relations a random relational structure. 
In particular, a random relational structure with just one binary relation is 
a random digraph. 

The Constraint Satisfaction Problem with the target structure A, de- 
noted by CSP(A), consists of deciding whether a given input relational struc- 
ture B of the same similarity type as A can be homomorphically mapped 
to A. It is easy to come up with examples of A such that CSP(A) is NP- 
complete and this is in a sense typical behavior as proved in [5j: If R(n, k) is 
a fc-ary random relation on the set [n] (with p = 1/2) that does not contain 
any elements of the form (a, a, ... , a) for a£i then 

Vk > 2, lim Prob(CSP([n],i?(ra,/c)) is NP-complete) = 1, (1) 

n— ¥oo 

Vn > 2, lim Prob(CSP ([n],R(n,k)) is NP-complete) = 1. (2) 

k— >oo 

There is a reason why the authors of [5] disallow loops: If A has only 
one relation R and R contains a loop (a, a, ... , a) then every B of the same 
similarity type as A can be homomorphically mapped to A simply by sending 
everything to a, so CSP(^4) is very simple to solve. 

Given a target structure A, the Homomorphism Extension Problem for 
A, denoted by EXT (A), consists of deciding whether a given input structure 
B and a given partial mapping / : B — >■ A can be extended to a homomor- 
phism from B to A. 

Let A be a set and a G A. The constant relation c a is the unary relation 
consisting only of a, i.e. c a = {(a)}. When searching for a homomorphism 
to A, the relation c a prescribes a set of elements of B that must be mapped 
to a. A little thought gives us that if A contains constant relations for each 
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of its elements (as is usual in the algebraic treatment of CSP) then CSP(A) 
and EXT (A) are essentially the same problem. 

Since the homomorphism extension problem is quite important to alge- 
braists, it makes sense to ask what is the typical complexity of EXT (A). We 
will use the phrase "EXT (A) is almost surely NP-compete for n large" as an 
abbreviation for "For each n £ N, there exists a random relational structure 
A n (whose precise definition is obvious from the context) such that we have 

lim Prob(EXT(A n ) is NP-complete) = 1." 

Because additional relations do not make CSP easier to solve, the limit 
([!]) gives us that that EXT(A) is almost surely NP-complete if A is a large 
random relational structure with no loops and at least one relation of arity 
greater than one. In the remainder of the paper we show that we can allow 
loops without making the problem any easier. 

3 The problem EXT for random digraphs 

We will begin by investigating random digraphs and then generalize our 
findings to all relational structures. 

Theorem 1. Let G be a random digraph on n vertices. Then EXT(G) is 
almost surely NP-complete for n large. 

Proof. Let G = (V, E) be a digraph. Understand G as a relational struc- 
ture and add to G every constant relation possible. Let v\, ...,V\ € V(G). 
Consider the set 

F Vl ,..., Vl = {u£ V(G) : Vi,(«i,«) G E(G)} 

We will call this set a subalgebra of G. 

For an interested reader, we note that sets F„ lv „ : „, are subalgebras in the 
universal algebraic sense and our technique can be greatly generalized to all 
primitive positive definitions (see [2]). For our proof, however, we need a 
lot less: Assume that for some choice of v\, . . . ,vi the subalgebra F Vlr .. )Vl 
induces a loopless triangle in G. We claim that we can then reduce graph 
3-colorability to EXT(G), making EXT(G) NP-complete. 

Let H be a graph whose 3-colorability we wish to test. We then under- 
stand H as a symmetric digraph and add to H new vertices w\ , . . . , wi and 
new edges (wi,u) for each i £ {1, . . . ,n} and all u € V(H), obtaining the 
digraph H' . Our EXT(G) instance will then consist of the digraph H' along 
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with the partial map / which maps each Wj to V{. Now / can be extended to 
a homomorphism if and only if H can be homomorphically mapped into the 
triangle induced by F vlt ___ tVl which happens if and only if H is 3-colorable. 

All we need to do now is to show that G almost surely contains a subal- 
gebra that induces a triangle. Our aim, roughly speaking, is to show that G 
almost surely contains many three element subalgebras because then there 
is a large chance that at least one of these subalgebras will be a triangle. 

We will partition V(G) into two sets A = {1, . . . , \n/2\ } and B = 
{\n/2},... , n}. We will now use points of A to define subalgebras lying 
in B. Denote by Sk the event "G contains at least k disjoint three-element 
subalgebras of the form F vlj „ mjVl C B for some v\, . . . , i>/ G A." We can write 

s k = (J Sa,...,C k , 

C\,...,C k (ZB 

yiytj,dnCj=$ 
Vi, |C;|=3 

where Sc lt ...,c k is the event "The sets Ci,...,Cfc are subalgebras of G" . 
Finally, denote by Tc x ,...,c k the event "There exists an i € {1, 2. . . . , k} such 
that the set Cj induces a triangle subgraph of G." 

Since a probability that a fixed Cj induces a triangle is p 6 (l — p 3 ), the 
probability of the event Tc lt ...,c k is (for Ci,...,Cfc pairwise disjoint three 
element sets) 

Prob(T Cli ..., Cfe ) = l-(l-p 6 (l-p 3 )) fc , 

which tends to 1 when k goes to infinity. 

Observe that the event Sc 1 ,...,c k is independent from the event Tc 1; ...,c k 
for each choice of C\, . . . , Ck C B since both events talk about disjoint sets 
of edges of G. 

Assume for a moment that for all k G N the value of Prob(Sfc) tends 
to 1 as n tends to infinity. Then, given an e > 0, we choose k so that 
Prob(Tc lv .. j c fe ) > 1 — e. When n is large enough, the digraph G contains 
some k pairwise disjoint three element subalgebras C±, . . . , with proba- 
bility at least 1 — e. The probability that one of the sets C±, . . . , them 
induces a triangle is Tc u ...,c k > 1 — £■ Thus we get an NP-complete CSP 
problem with probability at least (1 — e) 2 > 1 — 2e and since e was arbi- 
trary, we see that for large n the homomorphism extension problem is almost 
surely NP-complete. 

It remains to show lim ra _ i . 00 Prob(Sfc) = 1 for all k. Fix the value of k. 
For each value of n, let I be the integer satisfying np l > 1 > np l+1 . We will 
now search for the three element subalgebras of B for n large. We proceed in 
steps: Assume that after i steps we have already found m such subalgebras 
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C\, . . . , C m . In the (i + l)-th step, we take the vertices 1 + 2 + il, . . . , l + il 
of A and consider the subalgebra Fi + n j 2+u,...,l+il- If this subalgebra lies 
in B, has size three and is disjoint with all the sets C\, . . . ,C m , we let 
Cm+i = Fi+ii,2+ii,...,i+ih increase m by one and continue with the next step. 
Otherwise, i*i+ij,2+iz,. ..,i+u is n °t a good candidate for C m+ ±, so we leave m 
unchanged and continue with the next step. 

What is the probability that we find the (m+ l)-th subalgebra in a given 
step? Every vertex of G is in Fi + u > 2+u > ... : i+u with the probability p l . The 
probability that Fi + u t 2+u,...,i+u consists of three yet-unused vertices of B is 
then equal to 



, y i {i- P i ) n ' A > — — g — -p m (i-p 1 , 

If m > k, we have already won, so assume m < k: 

(n/2-3k) 3 v , 7x „ (l/2-3k/nf „ V/ , 

q > ^— - — V(i -p'r = — — g-^nVfi -p l r 

Now let r = C 1 / 2 "^/") anc [ ODserve that r > for n large enough. Using 
the the inequalities np l > 1 > np l+l we have: 



1 \ n 

> rn 3 p 3l (l - p l ) n > r(l - p l ) n >r[l 

\ pn 

The lower bound on q tends to r/e 1 ^ as n tends to infinity, so there exists 
a 5 such that q > 5 > for all n large enough. 

Therefore, the probability of producing a new three-element subalge- 
bra in a given step is at least 5 > and this bound does not depend on 
the number of subalgebras we have already found. Now observe that I is 
approximately log 1 / p n and therefore we have enough vertices in A for ap- 
proximately s = 2iog" / n s ^ e P s - If we choose n large enough, we can have s 
as large as we want and so the probability of finding at least k subalgebras 
can be arbitrarily close to 1. Therefore, lim n „ s . 00 Prob(S'fc) = 1, concluding 
the proof. □ 



4 Random relational structures 

It is easy to see that if A is a relational structure with unary relations only 
then EXT(A) is always polynomial. We would now like to investigate the 
case of relations of arity greater than two. Intuition tells us that greater 
arity means greater complexity. The intuition is right. 
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Lemma 2. Let I > 2, n be large and let A = ([n], S) be a relational structure 
with S a random l-ary relation. Then the homomorphism extension problem 
CSP(A) is almost surely NP-complete. 

Proof. We have proven the result for 1 = 2. If I > 2, consider the binary re- 
lational structure B = ([n],R) where R = {(x, y) S [n] 2 : (x, y, 1, 1, . . . , 1) € 
S}. It is easy to see that if S is a random Z-ary relation then B is a random 
digraph where each edge exists with the probability p. From Theorem [1] 
we see that EXT(B) is almost surely NP-complete. We will now show how 
to reduce EXT(B) to EXT(A) in polynomial time, proving that EXT(A) is 
almost surely NP-complete. 

Using algebraic tools, the reduction of EXT(B) to EXT(A) follows from 
the fact that R is defined by a primitive positive formula that uses only S and 
the constant 1. However, we will provide an elementary reduction here: Let 
C = (C, T) be a relational structure with a single binary relation T and let / : 
C — > [n] be a partial mapping. We add to C a new element e, construct the 
Z-ary relation U = {(x, y, e, e, . . . , e) : (x, y) £ T} and the partial mapping 
g : Cu{e} — > [n] so that g\c = / and g(e) = 1. A little thought gives us that 
g can be extended to a homomorphism (C U {e}, U) — > A if and only if / 
can be extended to a homomorphism (C, T) — > B, concluding the proof. □ 

Additional relations in A do not make EXT(A) easier, so we have the 
most general version of our NP-completeness result: 

Corollary 3. Let A be the relational structure ([n},{Ri : i E L}) where at 
least one Ri is a random relation of arity greater than one. Then EXT(A) 
is almost surely NP-complete for n large. 

As a final note, we will now prove the analogue of the limit ([2]) for EXT. 

Corollary 4. Let us fix a set A of at least two elements and let A = (^4, R) 
be a relational structure with R random k-ary relation. Then EXT(A) is 
almost surely NP-complete for k large. 

Proof. Assume first that k is even and let m = k/2. 
Consider the relational structure B = (A m , S) with 

S = {((ai, • • • , CLm), (a-m+l, ■ ■ ■ , «2m)) : • • • , «2m) G R}. 

It is straightforward to prove that S is a binary random relation on A m 
and therefore EXT(B) is almost surely NP-complete for large even k. What 
is more, EXT(B) can be easily reduced to EXT(A): If C = (C,T) is a 
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relational structure with T binary and / : C — > A m is a partial mapping, 
we construct the structure C = (C',T') with 

C = {(c, i) : c 6 C, i G {1, . . . , m}}, 

r / = {(( C ,l),...,( C ,m),( ( i,l),...,(d,m)) : (c,d) € T} 

and a partial mapping 5 : C — > A such that g(c,i) = ctj whenever /(c) is 
defined and equal to (a%, . . . , a m ). 

It is easy to see that g can be extended to a homomorphism from C to 
A if and only if / can be extended to a homomorphism from C to A. 

In the case that k = 2m + 1, we fix an e € A, choose IB = (A m , S) with 

S = {((dl, • • • , dtnji ( a rn+l, ■ ■ ■ , 0-2m) '■ • ■ ■ j 02m, e) G i?} 

and proceed similarly to the previous case. 

We see that for a large enough k, no matter if it is odd or even, the 
problem EXT(B) is almost surely NP-complete. □ 

5 Conclusions 

We have shown that the homomorphism extension problem is almost surely 
NP-complete for large relational structures (assuming we have at least one 
non- unary relation). In a sense, our result is not surprising since the rela- 
tional structures we consider are very dense, so it stands to a reason that 
we can find hard instances most of the time. 

It might therefore be interesting to see what is the complexity of CSP 
or EXT for large structures obtained by other random processes, particu- 
larly when relations are sparse. Such structures might better correspond to 
"typical" cases of CSP or EXT encountered in practice. Some such results 
already exist; see [6] for a criterion on the random graph process to almost 
surely produce projective graphs (if G is projective then EXT(G) is NP- 
complete, see [2]). Our guess is that both CSP and EXT will remain to be 
almost surely NP-complete in all the nontrivial cases. 
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